PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D01G1936
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 475aa    MW: 53950.6 Da    PI: 8.885
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D01G1936genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix92.54.4e-2959143187
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                  rW++qe+laL+++r++m+ ++r+++ k+plWeevs+k++e g++rs+k+Ckek+en+ k++k++k+g++++   + +t+++ dqlea
  Gh_D01G1936  59 RWPRQETLALLKIRSDMDVTFREASVKGPLWEEVSRKLAELGYHRSAKKCKEKFENVYKYHKRTKDGRSGK--ADGKTYRFCDQLEA 143
                  8********************************************************************96..56668******985 PP

2trihelix102.53.3e-32316400186
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                  rW+k e+ aLi++r+++++++++++ k+plWee+s+ m++ g++r++k+Ckekwen+nk++kk+ke++k+r + +s+tcpyf+ql+
  Gh_D01G1936 316 RWPKVEIEALIKIRTSLDSKYQDNSPKGPLWEEISNEMKKLGYNRNAKRCKEKWENINKYFKKVKESNKQR-PVDSKTCPYFHQLD 400
                  8*********************************************************************8.9999********97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007175.7E-456118IPR001005SANT/Myb domain
PROSITE profilePS500906.91158116IPR017877Myb-like domain
PfamPF138372.5E-1858143No hitNo description
CDDcd122031.97E-2358123No hitNo description
SMARTSM007170.0011313375IPR001005SANT/Myb domain
CDDcd122035.65E-27315380No hitNo description
PfamPF138372.4E-22315401No hitNo description
PROSITE profilePS500907.131315373IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.609.2E-4315372IPR009057Homeodomain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 475 aa     Download sequence    Send to blast
MLGGGGTTAS VSSGGGCNGI NEAAAPVAVF DTNDGNSNNS GEDDRSKVDE RDRSFGGNRW  60
PRQETLALLK IRSDMDVTFR EASVKGPLWE EVSRKLAELG YHRSAKKCKE KFENVYKYHK  120
RTKDGRSGKA DGKTYRFCDQ LEAFQNQPSI HWPPPPPMAA AATINQSISA VQMSNSTSSS  180
TSSDLELQGR KKRKRKWKDF FERLMKEVIQ KQQVMQKTFL EAIEKHERER IVRDEAWKVQ  240
EMSRLNRERE ILAQERSIAA AKDAAIMAFL QKLSEKQNLG QSQNSPLPPP AVAPAAVAPP  300
PDNGNQIQTH TPSSSRWPKV EIEALIKIRT SLDSKYQDNS PKGPLWEEIS NEMKKLGYNR  360
NAKRCKEKWE NINKYFKKVK ESNKQRPVDS KTCPYFHQLD VLYREKNKHD CSSKSNPLMV  420
RPEKQWPPPL EPHQQHHDTI MEDMMESDQN DDEEEDEGGS YELVASKPVS MGTAE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1189194RKKRKR
2189195RKKRKRK
3190195KKRKRK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJQ0130920.0JQ013092.1 Gossypium hirsutum trihelix transcription factor (GT7) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012468066.10.0PREDICTED: trihelix transcription factor GT-2-like
SwissprotQ391172e-82TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A0D2MEV10.0A0A0D2MEV1_G
STRINGPOPTR_0005s21410.11e-160(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.21e-71Trihelix family protein